Hierarchical Landmark Policy Optimization for Visual Indoor Navigation

نویسندگان

چکیده

In this paper, we study the problem of visual indoor navigation to an object that is defined by its semantic category. Recent works have shown significant achievements in end-to-end reinforcement learning approach and modular systems. However, both approaches need a big step forward be robust practically applicable. To solve insufficient exploration scenes make more semantically meaningful, extend standard task formulation give agent easily accessible landmarks form room locations those types. The availability allows build hierarchical policy structure achieve success rate 63% on validation photo-realistic Habitat simulator. hierarchy, low level consists separately trained RL skills high deterministic policy, which decides skill needed at moment. Also, show possibility transferring real robot. After bit training reconstructed scene, robot shows up 79% SPL when solving navigating arbitrary object.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Visual Landmark Framework for Indoor Mobile Robot Navigation

This article presents vision functions needed on a mobile robot to deal with landmark-based navigation in buildings. Landmarks are planar, quadrangular surfaces, which must be distinguished from the background, typically a poster on a wall or a door-plate. In a first step, these landmarks are detected and their positions with respect to a global reference frame are learned; this learning step i...

متن کامل

Visual Landmark Selection for Mobile Robot Navigation

A large number of landmarks selection techniques has been proposed. However, finding optimal solutions requires to solve some hard problems. In this paper, we consider the ρminimum overlapping region decomposition problem that was proposed for landmarks selection. This problem is NP-complete. We describe an approach to solve the problem optimally. This approach is based on an explicit reduction...

متن کامل

Visual Nouns for Indoor/Outdoor Navigation

We propose a local orientation and navigation framework based on visual features that provide location recognition, context augmentation, and viewer localization information to a human user. Mosaics are used to map local areas to ease user navigation through streets and hallways, by providing a wider field of view (FOV) and the inclusion of more decisive features. Within the mosaics, we extract...

متن کامل

State-based SHOSLIF for indoor visual navigation

In this paper, we investigate vision-based navigation using the self-organizing hierarchical optimal subspace learning and inference framework (SHOSLIF) that incorporates states and a visual attention mechanism. With states to keep the history information and regarding the incoming video input as an observation vector, the vision-based navigation is formulated as an observation-driven Markov mo...

متن کامل

Visual Landmark Navigation Through Large-scale Environments

Several theoretical models have been proposed for the visual homing behaviour of insects. We take one such model for visual homing, and extend it to an algorithm capable of autonomous exploration and navigation through large-scale environments. The algorithm uses a novel approach to waypoint selection during the construction of multi-leg routes. By locating the boundaries between visual locales...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE Access

سال: 2022

ISSN: ['2169-3536']

DOI: https://doi.org/10.1109/access.2022.3182803